Data Bridge: solving diverse Data Access in Scientific Applications
نویسندگان
چکیده
The nature of data for scientific computation is very diverse in the age of big data. First, it may be available at a number of locations, e.g. the scientist’s machine, some institutional filesystem, a remote service, or some sort of database. Second, the size of the data may vary from a few kilobytes to many terabytes. In order to be available for computation, data has to be transferred to the location where the computation takes place. This requires a diverse set of middleware tools that are compatible both with the data and the compute resources. However, using this tools requires additional knowledge and makes running the experiments an inconvenient task. In this paper we present the Data Bridge, a high-level service that can be used easily in scientific computations to perform data transfer to and from a diverse set of storage services. The Data Bridge not only unifies access to different types of storage services, but it can also be used at different levels (e.g., single jobs, parameter sweeps, scientific workflows) in scientific computations.
منابع مشابه
A FUZZY MODEL FOR ASSESSMENT PROCESSES
The methods of assessing the individuals’ performance usually applied in practice are based on principles of the bivalent logic (yes-no). However, fuzzy logic, due to its nature of including multiple values, offers a wider and richer field of resources for this purpose. In this paper we use principles of fuzzy logic in developing a new method for assessing the performance of groups of individua...
متن کاملNetwork Analysis of Interpersonal Relationships in Tehran Stock Exchange
The stock market has an important role in growth and development of countries. Network analysis is one of the latest method in analyzing the stock market. In quantitative science literature, It is a new concept for a macro view to whole market. Therefore, this research analyzes the interpersonal relationships’ network in the Tehran Stock Exchange (TSE). From the type of data collected and analy...
متن کاملThe IBM Research Accelerated Discovery Lab: Objectives and Experience
The IBM Research Accelerated Discovery Lab is a unique, collaborative environment specifically designed to facilitate complex analytic projects by tackling the challenges of data-intensive scientific discovery. The environment provides access to diverse data sources, unique research capabilities for analytics such as domain models, text analytics and natural language processing capabilities der...
متن کاملProteus, a Grid based Problem Solving Environment for Bioinformatics: Architecture and Experiments
Bioinformatics can be considered as a bridge between life science and computer science. Biology requires high and large computing power to performance biological applications and to access huge number of distributed and (often) heterogeneous databases. Computer scientists and database communities have expertises in high performance algorithms computation and in data management. Considering bioi...
متن کاملA Comprehensive Access Control System for Scientific Applications
Web based scientific applications have provided a means to share scientific data across diverse groups and disciplines extending beyond the local computing environment. But the organization and sharing of large and heterogeneous data pose challenges due to their sensitive nature. In this paper we analyze the security requirements of scientific applications and present an authorization model tha...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013